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Abstract 



Belief Propagation (BP) is a message-passing algorithm that computes the exact marginal distributions at every 
vertex of a graphical model without cycles. While BP is designed to work correctly on trees, it is routinely applied 
to general graphical models that may contain cycles, in which case neither convergence, nor correctness in the case 
of convergence is guaranteed. Nonetheless, BP gained popularity as it seems to remain effective in many cases of 
interest, even when the underlying graph is "far" from being a tree. However, the theoretical understanding of BP 
(and its new relative Survey Propagation) when applied to CSPs is poor. 

Contributing to the rigorous understanding of BP, in this paper we relate the convergence of BP to spectral prop- 
erties of the graph. This encompasses a result for random graphs with a "planted" solution; thus, we obtain the first 
rigorous result on BP for graph coloring in the case of a complex graphical structure (as opposed to trees). In par- 
ticular, the analysis shows how Belief Propagation breaks the symmetry between the 3! possible permutations of the 
color classes. 

Keywords: Belief Propagation, Survey Propagation, graph coloring, spectral algorithms. 

1 Introduction and Results 

1.1 Message Passing Algorithms 

This paper deals with a rigorous analysis of the Belief Propagation ("BP" for short) algorithm on certain instances of 
the 3-coloring problem. Originally BP was introduced by Pearl |[T4l as a message passing algorithm to compute the 
marginals at the vertices of a probability distribution described by an acyclic "graphical model", i.e., a representation 
of the distribution's dependency structure as an acyclic graph. Although in the worst case BP will fail if the graphical 
representation features cycles, various version of BP are in common use as heuristics in artificial intelligence and 
statistics, where they frequently perform well empirically as long as the underlying model does at least not contain 
(many) "short" cycles. However, there is currently no general theory that could explain the empirical success of BP 
(with the notable exception of the use of BP in LDPC decoding lfTTl[T2l[T5l ). 

A striking recent application of BP is to instances of NP-hard constraint satisfaction problems such as 3-SAT or 
3-coloring; this is the type of problems that we are dealing with in the present work. In this case the primary objective 
is not to compute the marginals of some distribution, but to construct a solution to the constraint satisfaction problem. 
For example, BP can be used to (attempt to) compute a proper 3-coloring of a given graph. Indeed, empirically BP 
(and its sibling Survey Propagation "SP") seems to perform well on problem instances that are notoriously "hard" for 
other current algorithmic approaches, including the case of sparse random graphs. 
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For instance, let G(n,p) be the random graph with vertex set V = {1, . . . , n} that is obtained by including each 
possible edge with probability < p — p(n) < 1 independently. Thus, the expected degree of any vertex in G(n,p) 
is {n — l)p ~ np. Then there exists a threshold r = r(n) such that for any e > the random graph G(n,p) is 
3-colorable with probability 1 — o(l) if np < (1 — e)r, whereas G(n,p) is not 3-colorable if > (1 + e)r 0"). In 
fact, random graphs G(n,p) with average degree just below r were considered the example of "hard" instances of 
the 3-coloring problem, until statistical physicists discovered that BP/SP can solve these graph problems efficiently in 
a regime considered "hard" for any previously known algorithms (possibly right up to the threshold density) J4]|6l. 
While there are exciting and deep arguments from statistical physics that provide a plausible explanation of why 
these message passing algorithms succeed, these arguments are non-rigorous, and indeed no mathematically rigorous 
analysis is currently known. 

The difficulty in understanding the performance of BP/SP on G(n,p) actually lies in two aspects. The first aspect 
is the combinatorial structure of the random graph G(n,p) with respect to the 3-coloring problem, which is not very 
well understood. In fact, even the basic problem of obtaining the precise value of the threshold t is one of the current 
challenges in the theory of random graphs. Furthermore, we lack a rigorous understanding of the "solution space 
geometry", i.e., the structure of the set of all proper 3-colorings of a typical random graph G(n,p) (e.g., how many 
proper 3-colorings are there typically, and what is the typical Hamming distance between any two). But according to 
the statistical physics analysis, the solution space geometry affects the behavior of BP significantly. 

The second aspect, which we focus on in the present work, is the actual BP algorithm: given a graph G, how/why 
does the BP algorithm "construct" a 3-coloring? Thus far there has been no rigorous analysis of BP that applies to 
graph coloring instances except for graphs that are globally tree-like (such as trees or forests). However, it seems 
empirically that BP performs well on many graphs that are just locally tree like (i.e., do not contain "short" cycles). 
Therefore, in the present paper our goal is to analyze BP rigorously on a class of graphs that may have a complex 
combinatorial structure globally, but that have a very simple solution space geometry. More precisely, we shall relate 
the success of BP to spectral properties of the adjacency matrix of the input graph. In addition, we point out that the 
analysis comprises a natural random graph model (namely, a "planted solution" model). 

1.2 Belief Propagation and Spectral Techniques 

The main contribution of this paper is a rigorous analysis of BP for 3-coloring. We basically show that if a certain 
(simple) spectral heuristic for 3-coloring succeeds, then so does BP. Thus, the result does not refer to a specific random 
graph model, but to a special class of graphs - namely graphs that satisfy a certain spectral condition. More precisely, 
we say that a graph G = (V, E) on n vertices is (d, e)-regular if there exists a 3-coloring of G with color classes 
Vi,V2, V3 such that the following is true. Let ly i £ R y be the vector whose entries equal 1 on coordinates v € Vi, 
and on all other coordinates; then 

Rl. for all 1 < i < j < 3 the vector ly i — ly. is an eigenvector of the adjacency matrix A(G) with eigenvalue —d, 
and 

R2. iff ± l Vi foralH = 1,2, 3, then || A(G)£ || < ed||f ||. 

We shall state a few elementary properties of (d, e)-regular graphs in Proposition QT]below (assuming that e is suffi- 
ciently small - e < 0.01, say). For instance, we shall see that (d, e)-regularity implies that each vertex v <G Vi has 
precisely d neighbors in each other color class Vj (i 7^ j). Moreover, (Vi, V2, V3) is the only 3-coloring of G (up to 
permutations of the color classes, of course), and for each pair i 7^ j the bipartite graph consisting of the Vj-Vj -edges 
is an expander. 

Furthermore, if a graph G is (d, e)-regular for any e < 0.01, say, then the following spectral heuristic is easily seen 
to produce a 3-coloring. 

1. Compute a pair of perpendicular eigenvectors x 1 ,^ 2 <= R y of A(G) with eigenvalue —d. 

2. Define an equivalence relation w on V by letting v w w iff Xv — X % w f° r * = 1, 2. Output the equivalence 
classes of ps as a 3-coloring of G. 
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The equivalence classes of « are precisely the three color classes Vi, Vz, V3. For if v, w belong to the same color class, 
then their entries in all three vectors 1^ — ly. (i < j) coincide; hence, as the space spanned by these vectors contains 
X 1 , X 2 , we have v m w. Conversely, if v w w, then the entries of v and w in all the vectors \y t — ly. coincide, because 
these vectors lie in the space spanned by x 1 , X 2 ; consequently, v, w belong to the same color class 14. 

The main result of this paper is that BP can 3-color (d, 0.01)-regular graphs in polynomial time, provided that d 
is not too small and the number of vertices is sufficiently large. We defer the description of the actual (randomized, 
polynomial time) BP coloring algorithm BPCol, which the following theorem refers to, to Section[2] 

Theorem 1. There exist constants do, k > such that for each d > do there is a number no — no(d) so that the 
following holds. If G = (V, E) is a (d, 0.01)- regular graph on n — \V\ > no vertices, then with probability > Kn~ l 
over the coin tosses of the algorithm, BPCol(G) outputs a proper 3-coloring of G. 

Observe that Theorem[T]deals with "sparse" graphs, since the lower bound no on the number of vertices depends 
on d. The proof yields an exponential dependence, i.e., no = cxp(0(d)). Conversely, this means that the average 
degree of G is at most logarithmic in n, which is arguably the most relevant regime to analyze BP (cf. Section [2]). 
Moreover, by applying BPCol 0(n) times independently, the success probability can be boosted to 1 — a for any 
a > 0. Besides, there is an easy way to modify the (initialization step of) BPCol so that the success probability of 
one iteration is at least k (rather than ku" 1 ), cf. Remark[4]for details. 

Let us emphasize that the contribution of Theorem[T]is not that we can now 3-color a class of graphs for which 
no efficient algorithms were previously known, as the aforementioned spectral heuristic 3-colors (d, 0.01)-regular 
graphs in polynomial time. Instead, the new aspect is that we can show that the Belief Propagation algorithm 3-colors 
(d, 0.01)-regular instances, thus shedding new light on this heuristic. Indeed, the proof of Theorem Q] which we 
present in Section[3] shows that in a sense BPCol "emulates" the spectral heuristic (although no spectral techniques 
occur in the description of BPCol). Thus, we establish a connection between spectral methods and BP. Besides, 
we note that no "purely combinatorial" algorithm (that avoids the use of advanced techniques such as Semidefinite 
Programming or spectral methods) is known to 3-color (d, 0.01 )-regular graphs. 

To illustrate TheoremQ] and to provide an example of (d, 0.01)-regular graphs, we point out that the main result 
comprises a regular random graph model with a "planted" 3-coloring. Let G n ,d,3 be the random graph with vertex set 
V = {1, . . . , 3n} obtained as follows. 

1 . Let V\ , V2 , V3 be a random partition of V into three pairwise disjoint sets of equal size. 

2. For any pair 1 < i < j < 3 independently choose a rf-regular bipartite graph with vertex set V^U Vj uniformly 
at random. 

For a fixed d we say that G n ,d,3 has a certain property V with high probability ("w.h.p"), if the probability that G Ut d,3 
enjoys V tends to 1 as n — > 00. Concerning G n> d,3, Theorem[T]implies the following. 

Corollary 2. Suppose that d > do is fixed. With high probability a random graph G = G n ,d,3 has the following 
property: with probability > nn^ 1 over the coin tosses of the algorithm, BPCol(G) outputs a proper 3-coloring of 
G. 

To prove CorollarylH we show that w.h.p. G n .d,3 is (d, 0.01) -regular, cf. Section|4] 

1.3 Related Work 

Alon and Kahale |2] were the first to employ spectral techniques for 3-coloring sparse random graphs. They present a 
spectral heuristic and show that this heuristic finds a 3-coloring in the so-called "planted solution model". This model 
is somewhat more difficult to deal with algorithmically than the G n ^d.3 model that we study in the present work. For 
while in the Gn.d^-model each vertex v E Vi has exactly d neighbors in each of the other color classes Vj ^ Vi, in the 
planted solution model of Alon and Kahale the number of neighbors of v € Vi in Vj has a Poisson distribution with 
mean d. In effect, the spectral algorithm in J2) is more sophisticated than the spectral heuristic from Section [L2l In 
particular, the Alon-Kahale algorithm succeeds on (d, 0.01)-regular graphs (and hence on G n ^d.3 w.h.p.). 

There are numerous papers on the performance of message passing algorithms for constraint satisfaction problems 
(e.g., Belief Propagation/Survey Propagation) by authors from the statistical physics community (cf. J4] [5] [TO) and 



3 



w 



V 




N(v) \ {w} 



Figure 1 : the BP equation. 



the references therein). While these papers provide rather plausible (and insightful) explanations for the success 
of message passing algorithms on problem instances such as random graphs G n . p or random fc-SAT formulae, the 
arguments (e.g., the replica or the cavity method) are mathematically non-rigorous. To the best of our knowledge, no 
connection between spectral methods and BP has been established in the physics literature. 

Feige, Mossel, and Vilenchik |8| showed that the Warning Propagation (WP) algorithm for 3-SAT converges in 
polynomial time to a satisfying assignment on a model of random 3-SAT instances with a planted solution. Since 
the messages in WP are additive in nature, and not multiplicative as in BP, the WP algorithm is conceptually much 
simpler. Moreover, on the model studied in |8] a fairly simple combinatorial algorithm (based on the "majority vote" 
algorithm) is known to succeed. By contrast, no purely combinatorial algorithm (that does not rely on spectral methods 
or semi-definite programming) is known to 3-color G n ,d.3 or even arbitrary (d, 0.01)-regular instances. 

A very recent paper by Yamamoto and Watanabe [16] deals with a spectral approach to analyzing BP for the Min- 
imum Bisection problem. Their work is similar to ours in that they point out that a BP-related algorithm pseudo-bp 
emulates spectral methods. However, a significant difference is that pseudo-bp is a simplified version of BP that 
is easier to analyze, whereas in the present work we make a point of analyzing the BP algorithm for coloring as it is 
stated in J4) (cf. Remark [8] for more detailed comments). Nonetheless, an interesting aspect of [16j certainly is that 
this paper shows that BP can be applied to an actual optimization problem, rather than to the problem of just finding 
any feasible solution (e.g., a fc-coloring). 

The effectiveness of message passing algorithms for amplifying local information in order to decode codes close 
to channel capacity was recently established in a number of papers, e.g. ifTTl [121 [T5l . Our results are similar in flavor, 
however the analysis provided here allows to recover a proper 3-coloring of the entire graph, whereas in the random 
LDPC codes setting, message passing allows to recover only a 1 — o(l) fraction of the codeword correctly. In lfl2l it 
is shown that for the erasure channel, all bits may be recovered correctly using a message passing algorithm, however 
in this case the message passing algorithm is of combinatorial nature (all messages are either or 1) and the LDPC 
code is designed so that message passing works for it. 



Following [4], in this section we will describe the basic ideas behind the BP algorithm. Since BP is a heuristic based on 
non-rigorous ideas (mainly from artificial intelligence and/or statistical physics), the discussion of its main ideas will 
lack mathematical rigor a bit; in fact, some of the assumptions that BP is based on (e.g., "asymptotic independence") 
may seem ridiculous at first glance. Nonetheless, as we pointed out in the introduction, BP makes up for this by being 
very successful empirically. At the end of this section, we will state the version of BP that we are going to work with 
precisely. 

The basic strategy behind the BP algorithm for 3-coloring is to perform a fixed point iteration for certain "mes- 
sages", starting from a suitable initial assignment. In the case of 3-coloring the messages correspond to the edges of 
the graph and to the three available colors. More precisely, to each (undirected) edge {v, w} of the graph G = (V, E) 
and each color a € {1, 2, 3} we associate two messages r/^ w from v to w about a, and rj^^ from w to v about a; 
in general, we will have r)%^ w ^ vZ^v Thus, the messages are directed objects. Each of these messages r)%^ w is a 
number between and 1, which we interpret as the "probability" that vertex v takes the color a in the graph obtained 
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from G by removing w. Here "probability" refers to the choice of a random (proper) 3-coloring of G — w, while the 
graph G is considered fixed. (There is an obvious symmetry issue with this definition, which we will discuss shortly.) 

Having introduced the variables 1]%_> W , we can set up the Belief Propagation Equations for coloring, which are the 
basis of the BP algorithm. The BP equations reflect a relationship that the probabilities r)y_ ytv should (approximately) 
satisfy under certain assumptions on the graph G, namely that 

r lv->w — „ 3 „ lb K ' 

for all edges {v, w} of G and all a G {1, 2, 3} (cf. Figure [T). 

The idea behind ( 12.1b is that v takes color a in the graph G — w iff none of its neighbors u G N(v) \ w has color 
a in G — v. Furthermore, the probability of this event ("no u has color a") is assumed to be (asymptotically) equal to 
the product YiueN(v)\w 1 — Vu->v °f me individual probabilities; that is, the neighbors u ^ w of v are assumed to be 
asymptotically independent. Of course, this assumption does not hold for arbitrary graphs G. Finally, the numerator 
on the r.h.s. of (12. Il l is just a normalizing term, which ensures that Y^,a=i Vv~*w = 1- 

The reason why in the above discussion we refer to the probability that v takes color a in the graph G — w obtained 
by removing w rather than just to the probability that v takes color a in G is that in the latter case the neighbors 
u G N(v) would never be (asymptotically) independent - not even if G is a tree. For in this case the presence of v — 
more precisely, the existence of the short path (it, v, u') for any two neighbors u, u' G N(v) of v - would render the 
colors within the neighborhood N(v) heavily dependent. Similarly, if G contains triangles, so that for some vertices 
v the neighborhood N(v) is not an independent set, then the independence assumption that is implicit in ( 12. Il l will 
be violated. Nonetheless, if G does not feature (many) short cycles - say, all the cycles are of length f2(log |V|) as 
| V\ — > oo - then the BP equations (12.1b may at least be asymptotically valid. The random graph model G n .d,3 provides 
an example of graphs (essentially) without such short cycles. 

Now, the basic idea behind the BP algorithm is the following. We start with a "reasonable" initial assignment 
Vv^w (0) an d use ED to perform a fixed point iteration by letting 

a fj , U _ _ U.ueN(v)\{w} 1 ~ <->u(0 
Vv-hhV + - 3 n j _ b (]) 

l^b=l llueN(v)\{w} 1 'lu-tvVJ 

for all {v, w} £ E and a G {1, 2, 3}. As soon as some of the values r)%_ yw (l + 1) are strongly "biased" toward either 
or 1, we try to exploit this information to obtain a coloring. 

Before we state the BP algorithm precisely, we need to discuss an important issue with the BP equations ( 12.11 ). 
Namely, in the case of 3-coloring the set of all 3-colorings is symmetric under permuting the color classes. Therefore, 
if we actually define rft^ w to equal the probability w.r.t. a random 3-coloring of G — w, then trivially t]^_^ w = | 
for all a, v, w. In fact, this trivial solution is actually a fixed point of (12. 2\ . Hence, we need to "break symmetry". In 
particular, it is not a good idea to choose the initial assignment t]^ w (0) = | for all a, v, w. Therefore, we do not start 
from T]y_^ w (0) = |, but we assign to each rj%^ w the value i plus a small random error 5. The hope is that this random 
error will cause the fixed point iterations ( |2.2t to converge to a non trivial fixed point (other than ^^^(0) = | for 
all a, v, w), and that this fixed point yields sufficient information to 3-color G. For instance, if % : V — > {1, 2, 3} is a 
3-coloring of G, then 

<^ = { J oth X erwis = e a (a = 1, 2, 3; {v, w} G E) 

is a fixed point of (12.2b . and clearly the 3-coloring x can be read out of the above messages easily. The algorithm 
BPCol is shown in Fig. [2] Observe that Step 1 ensures that 

3 

(0) = 1 for all {«,«;} EE. (2.3) 

a=l 

Remark 4. Theorem Q] states that the probability (over the random decisions in Step 1) that BPCol yields a proper 
3-coloring of its (d, 0.01)-regular input graph is This can be boosted to 51(1) by means of the following 
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Algorithm 3. BPCol(G) 

Input: A graph G = (V, E). Output: An assignment of colors to the vertices of G. 

1. Let 5 = exp(- log 3 n). 

For each v eV perform the following independently: 

choose a e {1, 2, 3} uniformly at random and assign rj^ w (0) = \ +6 and ^^,(0) = ± - § 
for all b e {1, 2, 3} \ {a} and tu e AT(V). 

2. For / = l,...,r = [log 4 nl 

compute Vv^m(l + 1) using J2.21 for all a, w, and -u;. 

3. For each v e V and each a e {1, 2, 3} compute /3£ = |A^(«)| _1 EueivM 1 ~ vZ->v(l*)- 
Assign to each v e V a color a e {1, 2, 3} such that /?" = max,,^^^} 

Figure 2: the algorithm BP Col. 

slightly more careful initialization. Instead of choosing a random a for each v E V independently, we choose a 
random permutation a of V and let W = {u((a — l)n/3 + 1), . . . , a (an/ 3)} (a = 1, 2, 3). Then, for each v € W a 
we set 77SL W (0) = \ + <5 and 7/^(0) = | — | (b € {1,2,3} \ {a}, w G N(v)). The proof of Proposition [T2 
below shows that this leads to a success probability of 0(1). Nonetheless, we chose to state BPCol with independent 
decisions in its initalization, because this appears more natural (and generic) to us. 

Remark 5. Although in the above discussion of the BP equation ( 12.2b we referred to "local" properties (such as the 
absence of short cycles), such local properties will not occur explicitly in our analysis of BPCol. Indeed, relating 
BPCol to spectral graph properties, the analysis has a "global" character. Nonetheless, various local conditions (e.g., 
a relatively small number of short cycles) are implicit in the "global" assumption that the graph G is (d, 0.01)-regular 
(cf. TheoremQ]). For more background on spectral vs. combinatorial graph properties cf. Chung and Graham 0. 

Remark 6. BPCol updates the messages r)%_ yw "in parallel", i.e, the messages carry "time stamps" (cf. ( 12.21 i). An 
alternative, equally common option would be "serial" updates, e.g., by choosing each time a random pair v, w of 
adjacent vertices along with a color ae {1,2,3} and updating i]y^ w via ( 12.11 ). 

Remark 7. BPCol exploits the result of the fixed point iteration (12. 2t in a more straightforward fashion than the 
version of BP described in J4|. Namely, after performing a fixed point iteration of ( 12.21 ), the algorithm in (4| does 
not assign colors to all vertices (as Step 3 of BPCol does), but only to a small fraction (the most decisive ones with 
respect to the calculated values). Then, the algorithm performs another fixed point iteration, etc. The reason is that in 
the random graph model considered in [4] typically the number of proper 3-colorings is exponential in the number of 
vertices, whereas (d, 0.01)-regular graphs have only one 3-coloring (up to permutations of the colors). 

Remark 8. Let us discuss the essential differences between BPCol for k = 2 and the algorithm pseudo-bp 
analyzed in fT6l . 

1. In pseudo-bp the products in ( 12. U are taken over all neighbors of v, including w. This apparently minor 
modification has a major impact on the analysis. For including w causes the messages r/^ w to be independent 
of w. Consequently, in pseudo-bp the messages at time I are 2 \V\ -dimensional objects, whereas in the present 
work the dimension is 2fc|i?|. 

2. pseudo-bp actually works with the logarithms \n(rfZ^ w ) of the messages instead of the original Of 
course, the equation ( 12.11 ) can be phrased in terms of ln(^"_^. TO ) as \n(i)^^ w ) = F(ln(r]^ v )) u! zN(v) f° r some 
function F. Now, in pseudo-bp this non-linear function F is replaced by a truncated linear function F. 
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3 Proof of Theorem 3] 



3.1 Preliminaries and Notation 

Throughout this section, we let e > be a sufficiently small constant (whose value will be determined implicitly in the 
course of the proof). Moreover, we keep the assumptions from TheoremQ] Thus, we let d > do for a sufficiently large 
constant do; in particular, we assume that do > cxp(e~ 2 ). In addition, we assume that n > no for some sufficiently 
large number no = no(d), and that G = (V, E) is a (d, 0.01)-regular graph on n = \V\ vertices. This is reflected by 
the use of asymptotic notation in the analysis, which always refers to n being sufficiently large. 

Furthermore, we let (Vi, V2, V3) be a 3-coloring of G with respect to which the conditions Rl and R2 from the 
definition of (d, 0.01)-regularity hold. (Actually a (d, 0.01)-regular graph has a unique 3-coloring up to permutations 
of the color classes, but we will not use this fact.) The following easy observation will be used frequently. 

Lemma 9. Let i,j € {1, 2, 3}, i =/= j. Then in G each vertex v € V, has precisely d neighbors in Vj. Consequently, 
\N(v)\ = 2d. 

Proof. Assume w.l.o.g. that i = 1 and j — 2. By condition Rl £ = ly ( — ly is an eigenvector of the adja- 
cency matrix A(G) = (a V w)v,weV with eigenvalue —d. Hence, letting 77 = — d£ = A(G)£, we have — d = rj v = 

-TtweNWnVj a vw = -\N(v) nVj\. □ 

Following |4], we will denote the elements (v, w) s A as v — > w. Furthermore, we shall frequently work with the 
vector space 7Z = R 3 ® R" 4 . Each element T e 1Z has a unique representation 




with P = (Tl^y^tA G R A (i = 1, 2, 3). Hence, we shall denote such a vector as T = (T' l u ^ w ) v ^ weAAe{iat3} . 
Semantically, one can think of Y l v ^, w as the "message" that v sends to w about color i. Note that the messages (l) 
defined from Section|2]constitute vectors n(l) = {T)%-nvQ))v^>weA,ae{i,2,3} G T^-- 

We will denote the scalar product of vectors £ , 77 as (£, 77) . Moreover, 1 1 £ 1 1 = yj (£, £) denotes the ^2-norm. In 
addition, if M : R™ 1 — > R" 2 is linear, then we let || M \ \ = max^R^ 11 ^ m =1 1 1 M£ 1 1 signify the operator norm 
of M. Further, M T denotes the transpose of M, i.e., the unique linear operator R™ 2 — > R" 1 such that (M£, 77) = 
(£, M T n) for all £ e R™ 1 , 77 e R" 2 . 

3.2 Outline of the Analysis 

In order to analyze BP Col, we shall relate the fixed point iteration of ( 12.2b to the spectral coloring algorithm from 
Section [L2l More precisely, we will approximate the fixed point iteration of the non-linear operation (12. 2\ by a fixed 
point iteration for a linear operator. One of the key ingredients in the analysis is to show how symmetry is broken 
(i.e., convergence to the all-i fixed point is avoided). Indeed, it may not be clear a priori that this will happen at 
all, because the random bias generated in Step 1 of BPCol is uncorrected to the planted coloring. The analysis 
is based on the following crucial observation (cf. Corollary Q~2]below): after a logarithmic number of iterations, for 
all v £ Vi,w € Vj , i 7^ j the messages r]%_> w are dominated by eigenvectors of the linear operator which we use 
to approximate ( 12.21 ). Furthermore, these eigenvectors mirror the coloring (Vi, V2, V3) and are (almost) constant on 
every color class Vi (with basically 0,1,-1 values on the different color classes). Hence, the (random) initial bias gets 
amplified so that the planted 3-coloring can eventually be read out of the messages. 
To carry out this analysis precisely, we set 

Moreover, we let B : 1Z — > 1Z denote the (non-linear) operator defined by 

1 TT 1 — -F a 

(n T \a 1 lYu&N{v)\w x 2 1 u-*v . . 

[ ° >v ~" w ~ 3 v 3 n i - 3 r<> 1 

l^b=l llu£N(v)\w 1 2 1 u->u 
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Then ( 12.2b can be rephrased in terms of the vectors A(7) = (A%^ w (l)) v ^ W £A, oe{i,2,3} <= ^ as 

A(f+l)=BA(i). (3.1) 

We shall see that we can approximate the non-linear operator B in d3.lt by the following linear operator B' if 
II A(Z)||oo is small; the operator B' maps a vector T = (Tf^ w ) ae si t 2,3}, v ^w&A £ 7£ to the vector B'(T) = 
(B'(T)^ w ) a ^ w € K with entries 

B'{T)^ W = - 1 - J2 Tl+v + lY, E r »— (3-2) 

ueN(v)\w 6=1 u£N(v)\w 

Indeed, B' : 7Z ^ 7Z is just the total derivative of $ at 0. 

We define a sequence 3(1) by letting S(0) = A(0) and 3(1) = B' l 3(0) for I > 1, thinking of H(Z) as a "linear 
approximation" to A(l). As a first step, we shall simplify the operator B 1 a little. 

Lemma 10. We have (B'(3(l)))^ = -§ Eue^v 3 ?U(0 for all I > 0, v w g A a g {1,2,3}. 
Proof. Step 1 of BPCol ensures that the initial vector satisfies 

3 3 

^S b u ^(0) = ^A^(0) = forall{ U ,t,}e£ (cf. (Q). 

6=1 6=1 

Therefore, by induction and by the definition ( 13. 2t of £>' we see that J2b=i 3 u-»u(0 = for all I > 0. Consequently, 
J2b=i ~52ugn(v)\w 3 m— = f° r a ^ ' — ^> i- e -> the second summand on the r.h.s. of d3.2t vanishes. □ 

Due to LemmafTOl we may just replace B' by the simpler linear operator C : 1Z — » 7Z defined by 



(CT)U W = ~\ E r «— {1,2,3}), 



2 



(3.3) 



which satisfies 

5(0 =C l 3(0) =£'A(0). (3.4) 

We also note for future reference that 

3 

E E v-> w (0 = for all w «) g A i > 0, (3.5) 

a=l 

because d2.3t entails that d3.5l ) is true for I = 0, whence the definition d3.3t of C shows that d3.5l ) holds for all I > 0. 

In order to prove Theorem[T] we shall first analyze the sequence 3(1) and then bound the error 1 1 3(1) — A(l) 1 1 oo 
resulting from linearization. To study the sequence 3(1), we investigate the dominant eigenvalues of C and their 
corresponding eigenvectors. More precisely, we shall see that our assumption on the spectrum of the adjacency matrix 
A(G) implies that the dominant eigenvectors of C mirror a 3-coloring of G. We defer the proof of the following 
proposition to Section [33] 

Proposition 11. Let efj g 7Z be the vector with entries 

r a\h f 1 if b = a, v g Vi, and w G N(v) fl V,-, , . , . . ri „ o1 . , . N 

(e&)«U, = (o J otherw i se " (^t.e^a 1 M,jE{l 1 2,3} 1 ,^). 

Moreover, let £ be the space spanned by the 18 vectors efj (a,i,j £ {1, 2, 3}, i 7^ j). Then L operates on £ as follows. 
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51. There are precisely six linearly independent eigenvectors {C21C3 : a — 1, 2, 3} with eigenvalue A = | + 

j\/ d 2 — 8d + 4, which satisfy 

II <S - r a i2 ~ e? 3 + 621 + 4 3 II 00 < lOOd" 1 , || C 3 a ~ e? 2 - e? 3 + + eg 2 || «, < lOOd" 1 . (3.6) 

These eigenvectors are symmetric with respect to the colors a = 1, 2, 3, i.e., for any two distinct a, 6 G {1, 2, 3} 
ana" all v —> w £ A we have 

(C;T V ^ W = (C^ w , and (C?)^ = 0. (3.7) 

/n addition, 

II Call = HC^II /ora//j e {2 5 3},ae {1,2,3}. (3.8) 

52. 77ze f/iree vectors e a = Sj^j e # w ^ a = 1,2,3 are eigenvectors with eigenvalue \ — d. 

53. For all^eE such that £ ± {e a , (f : a = 1, 2, 3, j = 2, 3} we We 1 1 1 1 < 5 || £ 1 1 . 

54. Furthermore, ££ C £ ana" £ T £ C £. 
Finally, we have 

55. ||£ 2 £|| < 0.01d 2 ||£|| forall£L£. 



(3.10) 



The eigenvectors that we are mostly interested in are £f , Q (a = 1, 2, 3) as (13.6b shows that these vectors represent 
the coloring (Vi, V2, V3) completely. As a next step, we shall show that 2(Z) can be approximated well by a linear 
combination of the vectors £f , (3 '< provided that I is sufficiently large. To this end, let 

s? = VS- iiA^IHIff || = 2,3,o= 1,2,3) (3.9) 

be the projection of the initial vector A(0) = 2(0) onto the eigenvector Q; we shall see below that the normalization 
in d3.9t ensures that 1" is bounded away from 0. Furthermore, recalling from ( 13.81 ) that 1 1 Cf 1 1 = 1 1 C2 1 1 f° r a ^ h a > 
we set 

_ l|A(0)|| 

v^iiciir 

Corollary 12. Suppose that I > L\ = 2 [log n\, and that 2(0) _L e° for a = 1,2,3. Then 

3 3 

E^ w (l) = v\ l E + for all a G {1, 2, 3} and {v, w} G E. 

a=l i=2 

Proof. Since by assumption the initial vector 2(0) is perpendicular to e a for a = 1, 2, 3 and because e , e 2 , e 3 are 
eigenvectors of C by S2, we have 2(£) _L e a . Therefore, we can decompose 2(Z) as 

3 3 

m = £(0 + E E Z "(0C?, where £(Z) _L {e a , (? :i G {2, 3}, a G {1, 2, 3}}. (3.11) 

a=l i=2 

Thus, to prove the corollary we need to compute the numbers zf{l) and bound 1 1 £(£) 1 1 qq. 

With respect to the coefficients zf(l), note that zf(l) = A zf(0), because by SI Q is an eigenvector with eigen- 
value A. Moreover, zf (0) = || Cf II" 2 (2(0), Q). Hence, @M and d37TOb yield zf (0) = if • Thus, 

z?(Z) = AV • zf. (3.12) 

To bound the "error term" 1 1 £(Z) || we note that S3-S5 entail 

||£ 2 7 || <0.01d 2 || 7 || <(0.3A) 2 || 7 || forall 7 ±{e a ,C a :*e{2,3},aG{l,2,3}}, (3.13) 
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provided that d> do for a large enough constant do > 0. Let k = [l/2\. Since £(2fc) = £ 2fc £(0), ( 13.13b implies that 
||£(2fc)|| = \\£ 2k m\\ <(0.3A) 2fc ||£(0)|| <(0.3A) 2fe ||S(0)||. (3.14) 
Moreover, as I < 2k + 1 and \\C\\ < d - \ by PropositionfTTl (13.14b yields 

lieCOIloo < IKCOII <d||€(2*)|| <d(0.3A)'||S(0)||. (3.15) 

Finally, if / > L 1; then d(0.3A) z ||B(0) || = o(A ; z/). Thus, the assertion follows from (|3~TTT >. ( l3~T2l and ( f3TT3T >. □ 

While in the initial vector A(0) = 3(0) the messages are completely uncorrected with the coloring (Vi, V 2 , V3), 
Corollary [12] entails that the dominant contribution to 5(Li) comes from the eigenvectors Q, which represent that 
coloring. This implies that all vertices v in each class V a send essentially the same messages to all other vertices 
w G Vb about each of the colors 1,2,3, and these messages are solely determined by the initial projections xf of 
A(0) onto Q. Hence, after L\ iterations the messages are essentially coherent and strongly correlated to the planted 
coloring. Thus, as a next step we analyze the distribution of the projections xf. To simplify the expression resulting 
from Corollary [12] let 

yl = x a 2 + xl, y a 2 = -x a 2 , and y a 3 = -7%. (3.16) 
Then ( 13.6b and Corollary fT2lentail that for all v G Vi, all w G N(v), and I > L\ we have 

Z^ w (l) = (y? + o(l))-v\ l . 
Of course, the numbers yf only depend on the initial vector A(0). Therefore, we say that A(0) is feasible if 
Fl. A(0) _L e a for a = 1, 2, 3, and 
F2. for any pair a, b G {1, 2, 3}, a^iwe have 

\y a a - 1| < exp(-l/e) and \y b a + 0.5| < exp(-l/e). (3.17) 

Proposition 13. With probability ^(n^ 1 ) over the random bits used in Step 1 of BPCol A(0) is feasible. 

The elementary (though tedious) proof of Proposition [T3l can be found in Section |3~4l Combining Corollary [T2l 
and Proposition[T3~l we conclude that with probability (namely, if A(0) is feasible) we have 

0.49i/A* < || 3(0 II 00 < l-l^A j (l>Lx). (3.18) 

Having obtained a sufficient understanding of the sequence S(Z), we will now show that these vectors provide a 
good approximation to the vectors A(l), which we are actually interested in. The proof of the following proposition 
can be found in Section [3~5l 

Proposition 14. Suppose that A(0) is feasible. Let L 2 > be the maximum integer such that \ \ 5(^2) || 00 ^ e - Then 
HS^-A^IU^-Me). 1 1 5(^)112,. 

Combining the information on the sequence S(Z) provided by Corollary [T2] and Proposition[l3]with the bound on 
|| S(Z2) — A(^2) || 00 from Proposition [T4l we can show that the messages obtained in the next one or two steps of 
the algorithm already represent the coloring rather well. To be precise, let us call the vector r)(l) proper if 

Va G {1, 2, 3}, b G {1, 2, 3} \ {a}, v G V a , w G N(v) : ^(f) > 0.99 A r^JZ) < 0.01. 

Proposition 15. 7fA(0) is feasible, then for either L3 = L 2 + 1 or L3 = L 2 + 2 the vector i]{Lz) is proper. 

The proof of Proposition[T5lis the content of Section [3~!6l 

Proposition [T31 shows that the "rounding procedure" in Step 3 of BPCol applied to the messages r/(L 3 ) would 
yield the coloring (Vi, V 2 , V3). However, BPCol actually applies that rounding procedure to r)(l*), where I* > L 3 . 
Therefore, in order to show that BPCol outputs a proper 3-coloring, we need to show that these messages r](l*) are 
proper, too. 
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Lemma 16. Ifi~j{l) is proper, then so is r](l + 1). 

Proof. Let v G V a for some 1 < a < 3, w € N(v), and {b, c} = {1, 2, 3} \ {a}. Since r)(l) is proper, we have 

n > n i-^(o>o.99* 

uev c rw(w)\ii> ^ uev c rw(u)\u> 



(3.19) 



1 > r^) = 99 — . ^ 

Consequently, the definition (12. 2t of the sequence r)(l) shows that 

+ TT 1-^,(0 TT i-^->J0 

0.01 ■ [ 1^2£L ) > o.Ol • 90 2d > 1000. (3.21) 

V o.oi / - 

As the construction (12.21) of r](l + 1) ensures that r)l_ w Q + 1) + Vv^wi 1 + 1) + + 1) = 1, ( 13.211 ) entails that 

»7?-™ ( z + !) > °- 99 and »7*— tu (' + !)< 0.01, whence r/(7 + 1) is proper. □ 

Proof of Theorem\I\ Proposition [Til states that A(0) is feasible with probability f^n -1 ). Therefore, to establish 
TheoremQ] we show that BPCol outputs the coloring (Vi, V%, V3) if A(0) is feasible. 

Thus, assume that A(0) is feasible and let L2 be the maximum integer such that || ^(Lz) \ \ 00 < £■ Then Corol- 
lary [l2]implies that L2 = 0(log 3 n), because 1 1 3(0) 1 1 00 = S = cxp(— log 3 n), and the £oo-norm of S(Z) grows by a 
factor of A in each iteration. Therefore, Proposition[T5lentails that r]{Lz) is proper for some L3 = 9(log 3 n). Thus, by 
Lemma[T6lthe final r/(£*) generated in Step 2 is proper, whence Step 3 of BPCol outputs the coloring V\ , V2, V3. □ 

3.3 Proof of Proposition [H 

The operation ( 13.31 ) of C is symmetric with respect to the three colors a = 1,2,3. Therefore, we shall represent C 
as a tensor product of a 3 x 3 matrix and an operator that represents the graph G. To this end, we define operators 
M : R- 4 — > R" 4 and PC : R" 4 — * TL A by 

(AA'E,) V -> W = ^2 {fctyv^w = ^w^v (2 G R" 4 ). (3.22) 

u6W(i;) 



2d-l 



Thus, 



uEN(v)\w 



i.e., — \{M — JC) represents the operation of C with respect to a single color a e {1,2,3}. Therefore, we can rephrase 
the definition ( 13.3b of C on the space 7Z = R 3 ® R- 4 as 




£ = -- 1 (3.23) 



Hence, in order to understand C, we basically need to analyze A4 — K. 
Fori, j G {1, 2, 3} we define vectors ey G R" 4 by letting 



(eij) 



13 )v — >w 



1 if v G Vi, to G Vj, and w G N(v), 
otherwise. 



The following lemma shows that it makes sense to split the analysis of M. — K, into two parts: first we shall analyze 
how M. — K, operates on the space £q spanned by the vectors e.y (1 < i,j < 3, i ^ j); then, we will study the 
operation of M — K, on £ q . 
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Lemma 17. If£ £ £ , then Mt M T K£, JC T £_ £ So- 



Proof. Let i,j,k £ {1,2,3} be pairwise distinct. Since /Ce^ = eji, we have K,£o C £q. Moreover, K? 
Furthermore, by Lemma [9] 



u£N(v) 



d if v £ Vj, 
otherwise. 



K. 



(3.24) 



Hence, Adeij — d(ejk + eji), and thus Ai£o c £q. In addition, the transpose of Ai is given by 



(M T E) V ^ 



u£N(w) 



Therefore, 

Consequently, M T eij = d(e^ + ejfc), whence A4 t £q C fo- 



ri if v £ Vi, 
otherwise. 



□ 



To study the operation of Ai — K. on £q, note that ( 13.24b implies that (M. — K)eij = dejk + (d — l)eji, if 
hjt k £ {1, 2, 3} are pairwise distinct. Therefore, with respect to the basis ei2, e23, eai, e23, e3i, e32 of £o, we can 
represent the operation of Ai — K, on£o by the 6x6 matrix 



M 



( 


d-1 
d 




d-1 
d 





d-1 
d 







d 

d-1 



d \ 
d-1 
d 

d-1 








o 7 



Observe that M is not symmetric. Hence, a priori it is not clear that M is diagonalizable with real eigenvalues. 
Nevertheless, a (very tedious) direct computation yields the following. 



Lemma 18. The 6x6 matrix M is diagonalizable and has the non-zero eigenvalues 1, 2d — 1, 



_ d Vd 2 - 8d + 4 
~~ 2 2 1 



t . d Vd 2 - 8d + 4 

A' = 1 . 

2 2 



(3.25) 



77ie eigenspace with eigenvalue 2d — 1 is spanned by 1. Moreover, there are two mutually perpendicular eigenvectors 
C2 ! C3 wifA eigenvalue A, which satisfy 



C 2 - (1,1,-1,-1,0,0)^ 



< 



10 



C 3 - (1,1,0, 0,-1,-1)^100 < 



10 



«^iic 2 ii = nan- 

Since M describes the operation of A-l — K, on the subspace £q, Lemma[T8limplies the following. 

Corollary 19. Restricted to the subspace £q, the operator M. — K. is diagonalizable with non-zero eigenvalues 1, 
2d — 1, and A, A' as in ( I3.25D . The vector e* = e ij s P ans die eigenspace of 2d — 1. Furthermore, there are two 

mutually perpendicular eigenvectors £2, C3 with eigenvalue A, which satisfy 



I C2 - (ei2 + ei3 - e 2 i - e 2 3) || 00 < 



10 



C3 - (eia + ei3 - e 3 i - e 32 ) || 00 < 



10 
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Corollary[l9]describes the operation of Ai — K, on £ q completely. Therefore, as a next step we shall analyze how 
Ai — K, operates on £ ^. More precisely, our goal is to show that restricted to £$ the norm of Ai — K, is significantly 
smaller than A. To this end, we observe that the operator K, merely permutes the coordinates. Consequently, 

II A: || < 1. (3.26) 

To to bound the norm of Ai on £ we consider three subspaces of £q. The first subspace 5 consists of all vectors 
£ G £q such that the value £ v ^ w only depends on the "start vertex" v; in symbols, 

S = {£ G £q : Vij — > w, v — > u G A : £„->™ = £v->u}- 

If £ G S and v G V, then we let £„_> = £„_>«, for any id G N(v), i.e., £„_> is the "outgoing value" of v. 
The second subspace T consists of all £ G £q" such that £, u ^v depends only on the "target vertex" v, i.e., 

T = {£ G £q : Vu -> w, w -> u G .4 : ^„^„ = ^^}. 

For (eT and » e Vwe let £_,.„ = £„_>„ for any u G N(v), i.e., £_>„ signifies the "incoming value" of v. 

Furthermore, the third subspace U consists of all £ such that for any vertex the sum of the "incoming" values 
equals 0: 

U = J £ G : Vv G V : £ £„^ = 
[ ueN(v) 

Lemma 20. 1 . We have U = Kern(M) n £^ . 

2. Moreover, ;/£ G T, then {M£,)v-*w = 2d£—, v for all v — > w G A. In particular, Ai£ G S. 

3. Furthermore, T _L U, and Eq =T © U. 

Proof. The first assertion follows immediately from the definition (13.221 of Ai. Moreover, if £ G T, then (A4£)v-nv = 
2~2u£N(v) = \N(v)\£^ v = 2d£_,„ due to Lemma|9j whence 2. follows. Consequently, if £ G T and r) £ U, then 

(£,??) = ^2 Zu^vVu^v = 2d £^„ ^ r?„^=0, 
whence T _L [/. Furthermore, for any 7 G the vector 77 with entries 

uEN(w) 

Ues in T, because the sum on the r.h.s. is independent of v. In addition, £ = 7 — 77 satisfies 



«ejv(»;) 



Tu-n 

«eiV(D) 



2dr)^ v = for any «eF, 



so that £ G U. Hence, any 7 G £$ can be written as 7 = 77 + £ with r\ G T and £ G U, i.e., £q- = T $>U. □ 
By now we have all the prerequisites to analyze the operation of Ai on 8 q . 
Lemma 21. If^e£^-,then \\M 2 £\\ < O.Old 2 || £ || . 

Proof. Let £ G £q . By the third part of Lemma [20l there is a decomposition £ = £t + £ir such that £t G T and 
£(7 G f/. Furthermore, the first part of Lemma [20] entails that Ai£ = A4£t- Therefore, we may assume without loss 
of generality that £ = £t G T. Hence, the second part of of Lemmal20l implies that 

|| £' II = 2d II £|| (3.27) 
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and £' = MS, £ 5. Consequently, letting £" = A4£' = A4 2 £, we obtain 

£"->w = E = E (3.28) 

ugiV(t)) u£N(v) 

Since the r.h.s. of ( 13.281 ) is independent of w, we conclude £" £ 5. 

In order to bound 1 1 £" 1 1 = 1 1 A4 2 £ 1 1 , we shall express the sum on the r.h.s. of ( 13.281 ) in terms of the adjacency 
matrix A(G). To this end, consider the two vectors 

v' = (Vv)vev e R y with = 
i" = MWea v with < = 

for all v G V. Then 

II £'11 2 = E *Ut 2 = 2dX)£- 2=2d lh'll 2 ' and analogously (3.29) 

\\e\\ 2 = 2d || 77" || 2 . (3.30) 
Furthermore, d3.28t implies that 77" = J2 u eN(v) Vu f° r all u G V, i.e., 

Tf' = A(G)rf. (3.31) 
Combining ( EHTl i, 0291 , J3301 , and ( EHTT l, we obtain 

l|A«-{||-||fll- M|l y' -lltl|. 0-32) 

Hence, we finally need to bound 1 1 A(G)rf 1 1 . To this end, we shall employ our assumption that G is (d, 0.01)- 
regular; namely, condition R2 from the definition of (d, 0.01)-regularity entails that 1 1 A(G)( \ \ < O.OOld 1 1 £ 1 1 for 
all C -L lvi j ly 2 1 Ivs ■ Thus, we need to show that 77' _L \y i for z = 1,2,3. Assuming w.l.o.g. that i = 1, we have 

V',1 V1 ) = E^ = ( 2d ) _1 E ^ = (2d)- 1 (e' J ei2+e 13 ) 

= (2d)- 1 (Mtei2 + e 13 ) = {2d)~ 1 {^M T (e 12 +e 13 )). (3.33) 

Further, as A4 T (ei2 + ei 3 ) G £0 by Lemma [P71 while £ G £q by our assumption, ( 13.331 implies that (rf, ly 1 ^ = 0. 
Consequently, we obtain that 1 1 A(G)r)' 1 1 < O.OOld 1 1 77' 1 1 , whence ( 13.32b yields the assertion. □ 

Proof of Proposition^!]] Combining Corollary [T9lwith the tensor product representation ( 13.23b of C, we conclude that 
the six vectors 




(.7 = 2,3) (3.34) 

are eigenvectors of C with eigenvalue A = — ^A. In addition, the tensor representation d3.341 > of the vectors Q 
immediately implies the symmetry statement d3/7t , while (13-81 ) follows from Corollary [19] Moreover, once more by 
Corollary [19] the three vectors 




are eigenvalues with eigenvector — | (2d— 1) = | — d, and all other eigenvalues of C restricted to £ are < | in absolute 
value. In addition, Lemma [P71 shows in combination with (13.23b that ££, C T £ C £. Finally, Lemma |21| implies in 
combination with d3~23l that 1 1 £ 2 £ 1 1 < O.Old 2 1 1 £ 1 1 for all £ _L £. □ 
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3.4 Proof of Proposition H3l 

Before we get to the proof, let us briefly discuss why the assertion (i.e., Proposition[T3]l is plausible. In fact, let us point 
out that the vector A(0) is easily seen to satisfy F2 with probability f2(l). For each of the inner products (A(0), £f ) 
is a sum of n independent random variables, whence the central limit theorem implies that 

^HA^Il^HCf M- 1 <A(0),C?> 

is asymptotically normal (the factor || A(0) || _1 1| Q || _1 , which is independent of the random vector A(0), is 
needed to ensure that mean and variance are of order 8(1)). In fact, since the vectors (Q ) a =i.2,3; i=2,3 are mutually 
perpendicular, the joint distribution of the random variables 

(VS||A(0)||- 1 ||C?||- 1 (A(0) ) C?))i=a,3;a=l, 3 ,3 

is asymptotically a (multivariate) Gaussian. Therefore, the probability that A(0) satisfies F2 is f2(0). 

However, once we condition on A(0) satisfying Fl, the entries of A(0) are not independent anymore, whence 
the above argument does not yield a bound on the probability that A(0) satisfies both Fl and F2. Nonetheless, the 
dependence of the entries of A(0) is weak enough to allow for an elementary direct analysis. We begin with bounding 
the probability that A(0) satisfies Fl. To this end, we define a partition (Wi, W2, W3) of V by letting 

Wi = {v G V : Al^ w = S for all w G N(v)}; 

in other words, Wi consists of all vertices for which the random number a chosen in Step 1 of BPCol was equal to i. 

Lemma 22. The probability that A(0) satisfies Fl is f^n -1 ). 

Proof. A sufficient condition for A(0) to satisfy Fl is that Wi = W2 = W3 — Moreover, the total number of 
vectors that can be generated by Step 1 of BPCol equals 3™, out of which ( n y 3 n " 3 n ^ 3 ) yield W\ = Wi = W3 = |. 
Therefore, the assertion follows from Stirling's formula. □ 

In the remainder of this section we condition on the event that A(0) is such that W\ = Wi — W3. Thus, 
(W\, W2, W3), is just arandom partition of V into three classes of equal size, andforallu G Wi,allj G {l,2,3}\{i}, 
and all w G N(v) we have 

A 1 = S A J 



Lemma 23. For any constant c\ > there exists a constant ci > such that the following holds. If (s")i, a =i,2,3 are 

-2 



integers of absolute value \sf\ < C\\fn such that 53 ra =i s< j = St=i s \ = f or a ^ 1 ^ b, j < 3, then 



P 



Tl 

VI < a,i < 3 : \V a D Wi\ = - 

9 



> C2n" 



Proof. The sets W\, W2, W3 are randomly chosen mutually disjoint subsets of V of cardinality n/3 each, whereas 
Vi , V2 , V3 are fixed subsets of V. Therefore, the total number of ways to choose W\ , W2 , W3 is given by the multino- 
mial coefficient ( n ^ 3 n ™ 3 ra ; 3 ); by Stirling's formula, 

( " < 10n- x 3™. (3.35) 

\n/6, n/6, n/3J 

Moreover, the number of ways to choose W\, W2, W3 such that \V a fl Wj| = sf equals 

n ( n/3 ^ 

W in/9 + 4,n/9 + 4,n/9 + sg ' 
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(because the a'th factor on the r.h.s. equals the number of ways to partition V a into three pieces V a H Wi, V a fl W%, 
V a fl W3 of the desired sizes). Combining (13.35b and ( I3.36l l with Stirling's formula, we get 



VI <a,i <3: \V a n W i \ = -+s a i 



Furthermore, once more due to Stirling's formula, 



> 



> 



i(n/3)! 



1 3 



10'3 B ni< i ,a< 3 (»»/9 + *?) ! 

n 5/2+n 

10-(9e)»ni<i, o <3("/9 + s?)! 



(3.37) 



(n/9 + sf)! < exp(-n/9-s?)(n/9 + s?) n / 9+ <^ 

= exp(-n/9 - sf )(n/9) n/9+fl < (1 + 9s<7n) n/9+s * 
< exp(-n/9 + 9sf /n) (n/9) n/9+s «\ 

Since we are assuming that s? < c\ \fri and ^ i=1 s? = 0, ( 13 ,38b entails that 

[J (n/9 + <)! < (n/9e)"n 9 / 2 exp(9 ^ sf/n) < c' 2 (n/9e) n n 9 ^ 

Ki,o<3 a,i 



(3.38) 



(3.39) 



for a bounded number c 2 that depends only on c\. Finally, plugging ( 13.391 ) into ( 13.371 ) and cancelling, we obtain the 



assertion. 



□ 



Corollary 24. For any two constants C3,/? > there exists a constant C4 > such that the following holds. If 
(i")i,o=i,2,3 are numbers of absolute value < C3 5mc/z f/iaf J^o=i *i = Sj=l = 0/ or 1 < J < 3, f/ien 



Vl<a,i<3:|n-^(|Kn^|--)-i?|</3 



> C4. 



Proo/ Let # be the set of all tuples «) a ,;=i.2,3 of integers such that |n _ ^s^ — rf| < f3, and X)o=i s j = Z)*=i s i = 
for all 1 < b,j < 3. Then |5| > /3 4 n 2 /32. Moreover, all (sf) aji= i )2 ,3 G S satisfy |^| < (c 3 + l)Vn(l < b,j < 3). 
Therefore, Lemma|23] (applied with c\ = C3 + 1) shows that 

P 



Va,i:\n-*(\V a nWi\--)-1%\<0\ > ^ P |Vo,i : |K n Wi\ 

> c 2 |5|n- 2 > /3 4 c 2 /32, 



as desired. 



□ 



Since the vector A(0) just represents the partition W\, W2, W3, and the vectors e tj J ust represents the col 



oring Vi, V2, V3, Corollary l24leasilv implies a result on the joint distribution of the inner products ( A(0), ^2 



Corollary 25. For any two constants C5, 7 > there exists a constant cq > such that the following is true. Suppose 
that (zf)i< a ,i<3 are numbers such that \Zj\ < C5 and X)i=i z \ = ELi z j = ®f or a ^ 1 < J < 3. Then 



> C6. 



A(0),E^e« 

V °' * : \ Z t 11 A^n\ II II Aa II ' < 7 



l|A(0)|| || Cf 

Proof. The definition of 77(0) in Step 1 of BPCol shows that 



5 ifu€W a , 

-<5/2 otherwise. 



for all v — » i/J £ A 



(3.40) 
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Therefore, 



||A(0)|| = y/2,dn/2 ■ 8. (3.41) 
Moreover, by PropositionfTTI there is a number 0.99 < C7 < 1.01 such that 

1 1 C° 1 1 = c 7 1 1 e a 12 + e? 3 - e% x — e§ 3 1 1 = 2c 7 Vd^. (3.42) 



Furthermore, using d3.40t , we can easily compute the scalar product ^A(O), e ijJ 0- — a i * — 3): 

/a(o),^4\ = K^ w (o) = \v i nw a \-dS-\v i \w a \-^- 

Q If 

= ydVinW.I-n/9) [because |^| = |W„| = n/3]. (3.43) 
Combining (13 -4 li t. ( 13.42b . and ( I3.43l l, we conclude that for a certain constant c$ > 



^i=^ = .(\V i nW a \-n/9). 



I|A(0)|| ||Cf II v 

Therefore, the assertion follows from Corollary l24lbv setting s" = • zf and /? = j/cs- □ 

Proof of Proposition U 3\ Let a = exp(— 1/e) and 

V 2 otrlrwise (< = 2,3; « = 1,2,3). (3.44) 

Then the definitions d3.9t and ( 13.16b of the variables xf and yf entail that 

P[Va,z€ {1,2,3},^ a: \y a a -l\ <aA|tf-l/2| < a] 

> P\ia,ie {1, 2, 3} : \x? - xf \ < a/2] . (3.45) 

Therefore, we shall derive a lower bound on P [Va, i : \xf — xf\ < a/2]. 
To this end, let 

e?= E 4 (1 <«,«'< 3), 
je{i,2,3}\{»} 

and let V C R' 4 be the space spanned by these nine vectors. In addition, let q : R A — > V be the orthogonal projection 
onto V. Since the construction of the initial vector A(0) in Step 1 of BPCol ensures that A(0) € V, we have 

l|A(0)|| - Hffll = (A(0 ), Cf ) = ( g A(0),a = (A(0),gC a >- 

Hence, instead of the vectors we may work with their projections qQ onto V. Thus, let qfj £ R be the coefficients 
such that 

3 

</C Y. </:,<'; (1 = 2,3,0 = 1,2,3). 

Then by symmetry we have = q\j for all 1 < a, b < 3; therefore, we will briefly write q^ instead of qfj. 
Furthermore, d3.6l l implies the bounds 

0.99 < q 2 i < 1.01, -1.01 < Q22 < -0.99, -0.01 < q 2i < 0.01, (3.46) 
0.99 < gat < 1.01, -0.01 < q 3 2 < -0.01, -1.01 < 933 < -0.99. (3.47) 



17 



As a consequence, the matrix 

921 ?22 923 
Q = ( 931 932 933 
111 

is regular, and there is a constant eg > such that 1 1 Q _1 1 1 < eg. 
Let 



(a = 1,2,3). 



Since 1 1 Q 1 1 1 < eg and \x% \ < 1 for all a, i, we have 



\z?\<5cg (l<o,i<3). 



In addition, ( 13.44b and ( 13.481 ) imply that 

3 / zf 

E 4 

3 

E- 



a=l \ Zo 



3 / 



E 



a=l \ 

(1 < 6 < 3). 



= 0, and 



Combining ( l3.49b - d3.5U . we see that (zf )i< a ,i<3 satisfies the assumptions of Corollary |25j whence 

(A(0),e?) 



Vo,i : k°- 



A(0)|| ||C 



> c 6 



(3.48) 



(3.49) 



(3.50) 



(3.51) 



(3.52) 



for some constant C2 > 0. Furthermore, if A(0) G R A satisfies \zf — 1| &(a)\\ Tl'c" I ' — a2, t ' len an d the 

bounds ( I3.46M3.47I ) imply that 



v> || a(o) || || cil Vn \-\^[ z * 



A(0)|| ||g II J 1 

3 

< a 2 ^ 19^1 < 3a 2 < a/2 (j = 2, 3; a = 1, 2, 3). 

i=l 

Therefore, ( |3.52t yields P |Va, z : |x° — xf\ < a/2] > cq. Thus, the assertion follows from ( |3.45t and Lemmal22l □ 



3.5 Proof of Proposition [TH 

Our goal in this section is to bound the error || A(Z) — S(Z) || oo resulting from replacing the non-linear operator 
B by the linear operator C. Since A(Z) = #'A(0) and S(Z) = C l E(0) = C l A(0) by (E§, the main difficulty 
of this analysis is to bound how errors that were made early on in the sequence (i.e., for "small" I) amplify in the 
subsequent iterations. To control this phenomenon, we shall proceed by induction on I. We begin with a simple 
lemma that bounds the error occurring in a single iteration. Recall that the constructions of S(Z) and A(Z) ensure that 
ELi E v^ w V) = E«=i A^(Z) = for all v -> w £ A (cf. (fH and £3)). 

Lemma 26. Suppose that T satisfies Ea=i r 2-«> = for all v ^ w e A If \\T \\ ^ < O.OOIgT 1 , f/ien || ST - 

^criu < iood 2 ||r||^. 
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Proof. We employ the elementary inequalities 



exp(-x - x 2 ) < 1 - x < exp(-x) < 1 - x + x 2 (\x\ < 0.1). 



(3.53) 



Let v — > w G A, a G {1, 2, 3}, and set 

n fa = 



n i-? 



6 



(b G {1,2,3}). 



Moreover, let T = OT. Then we can rephrase the definition d3. lb of B as 



v — >w 



3 ELi n & 



(3.54) 



In order to prove the lemma, we shall bound the error term — (1 — | YlueN(v)\w To ^is end, note 

that by ( 13.531 ) there exist numbers < a b u ^ v < 9/4 such that 1 - §r*_,„ = e xp(-|r^^ - a^^J,^). Hence, 
once more by ( 13.53b there is a number — 1 < (3 b < 1 such that 



II5 = exp 



E_r b 4- n b V b2 
r\ U — >V ' U — >V U — >V 



uGN(v)\w 



1- E 

Lb + Eb, 

1 E ^ 



— r fc 4- n b v b2 

cy U — >V "T" — >'U U — >V 



+ /3 b 



-, 2 



E-r 6 4- (T fc r b2 
2 u — >v ' u — >u u — >u 

u&N(v) 



where we let 
and 



(3.55) 



u£N(v)\w 



E-V b 4- rv b V b2 
rj U — >V ' U — >V U — >V 



uGN(v) 



Further, since || T \\ oo < 0.001/(iby assumption and |iV(u)| = 2d by Lemma|9] we obtain the bound 

\E b \ < iod 2 ||r||^ <o.oi. 



(3.56) 



As Y^l=i Lb = 3 due to our assumption that J2b=i ^u^v = for all m ^ w e A, plugging d3.55l l into ( 13.541 ) 



yields 



v — >w 



1 



L a + E a 



L a 3E a + L a (E 1 +E 2 + E 3 ) 



3 3 + E 1 +E 2 + E 3 3 

Since \L a \ < 1 + 3d \ \ T \ \ x < 2, d336t and dlBTl i yield that 

1 



3(3 + E 1 +E 2 + E 3 



j(l-L a )-TS^ w \<100d'\\T\\i o . 
Finally, a glance at d3.3l l reveals that (CT)^ W = |(1 — L a ), and thus the assertion follows from ( 13.581 ). 



(3.57) 



(3.58) 



□ 



Lemma l26l allows us to bound the error || A(^ + 1) — S(Z + 1) || 00 resulting from iteration I + 1 in terms of the 
error 1 1 A(Z) — S(Z) 1 1 ^ from the previous iteration. In the sequel we let C > denote a sufficiently large constant. 

Lemma 27. Suppose that \\A(l) - 3(1) || «, < (Cd)" 1 . Then 

\\A(l + l)-E(l + l)\\ 00 <2Cd 2 \\E(l)\\ 2 OD +4d\\A(l)-E(l)\\ 00 . 
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Proof. By Lemma [26] and the definition ( 13.3b of C we have 

|| A(Z + 1)-3(Z + l)||oo = ||BA(Z)-£3(Z)||oo 

< ||BA(0-£A(0lloo+ ||£A(0-/:3(0IU 

< Cd 2 || A(Z) || 2o+2dI|A(0-S(0IIoo. (3-59) 

Moreover, 1 1 A(i) 1 1 «, < 1 1 3(1) 1 1 «, + 1 1 E(Z) - A(J) 1 1 oo, whence yields 

|| A(l + 1) - 3(Z + 1) || oo < 2Cd 2 [|| 3(0 || I + || 5(0 - A(0 || I] + 2d\\ A(i) - S(Z) || 
This implies the assertion, because we are assuming that 1 1 A(0 — 5(0 1 1 oo < (Cd)^ 1 . □ 
Further, applying Lemmal27lL times recursively, we obtain the following bound. 
Corollary 28. Suppose that || A(Z) - 5(0 || «> < (CcfT 1 for all I < L. Then 

L-l 

1 1 A(L) - 3(L) 1 1 oc < 2Cd 2 ]T W" 1 1 1 S(L - j ) 1 1 1 + Cd 2 (4d) L " x 1 1 A(0) 1 1 L . 

To proceed, we need the following (rough) absolute bound on the error 1 1 A(L) — 5(i) 1 1 oo. 
Lemma 29. 7/i < log 2 n, then \\ A(L) - 3(£) || «, < (Cd)" 1 . 

Proof. The proof is by induction on Z. For L = the assertion is trivially true. Thus, assume that 1 1 A(Z) — S(Z) 1 1 ^ < 
(Cd)" 1 for all I < L < log 2 n. Then Corollary |28]entails that 

L-l 

|| A(L) - S(L) || oo < 2Cd 2 ]T(4dr 1 || 5(L - j) II L + || A(0) || 2 oc . 

Further, the definition ( 13.3b of £ shows that 

||5(Z)|| oo <(2d) z ||A(0)|| oo = (2d) / <5. 

Hence, 

|| A(L) - S(L) || oo < ±Cd 2 (2d) 2L - 2 S 2 + Cd 2 (Ad) L - 1 5 2 . 

As S < cxp(— log 3 n) and d = 0(1), the r.h.s. is o(l) asm oo, and thus 1 1 A(L) — 5(i) 1 1 oo < (Cd) -1 , provided 
that n is sufficiently large. □ 

Lemma 30. Let L* be the maximum integer such that || 5(L*) || oo < (..Then for all log 2 n < L < L* we have 
||S(L)-A(L)||oo<-log( e )- mL)\\l. 

Proof. By the definition d3.3l l of C there are constants c x , c 2 > such that 

1 1 5(0 | |oo < {2d) 1 6 (VZ< C2 logn), (3.60) 
II 5(0 1| oo € cl x X l 8/\/dn,c 1 X l 5/Vdn (VZ>c 2 logn). (3.61) 

We proceed inductively for log 2 n < L < L*. Thus, assume that 1 1 3(Z) - A(Z) 1 1 oo < ci 1 1 5(Z) 1 1 ^ for all log 2 n < 
I < L. Since A > O.ld and 1 1 5(i) 1 1 ^ < e, this implies that 

1 1 5(0 - A(Z) || < (Cd)- 1 for all log 2 n < I < L. 

Furthermore, 1 1 5(Z) — A(Z) 1 1 oo < (Cd) -1 for all I < log 2 n by Lemma|29l Therefore, we can apply Corollary |28lto 
obtain 

L-l 

||3(£)-A(L)||oo < 2Cd 2 Y,(4dy- 1 mL-3)\\l + Cd 2 (4d) L - 1 \\A(0)\\ 2 oo . (3.62) 



20 



Since L > log 2 n and A > O.ld, ( 13.601 ) and (13.61b imply that the sum on the r.h.s. of ( 13.62b is dominated by the term 
for j = L — 1. Hence, 



< c 3 d 2 S 2 [n- 1 ^- 2 + (4d) i - 1 ] 

< 2c 3 d 2 S 2 \ 2L - 2 n- 1 < c 4 S 2 \ 2L n~\ 



(3.63) 



Combining ( 13.611 ) and ( 13.631 ), we conclude that 1 1 S(L) — A(i) 1 1 oo < — log(e) • 1 1 S(L) 1 1 ^ (provided that e is chosen 
small enough). □ 

Finally, Proposition[l4]follows from Lemma[30l directly. 



3.6 Proof of Proposition [15] 

Let n = v\ L2 . Then Corollary fT2l and Proposition [T3l entail that 

(1 - e 3 )n < A^ W {L 2 ) < (1 + e 3 )^ if v € V a and w G N{v), and 
(-i-e 3 )^ < A^ W (L 2 ) < (-i + e 3 )/i if v # V a and w e 



(3.64) 
(3.65) 



To prove Proposition[T5l we consider two cases. The first case is that 1 1 A(L 2 ) 1 1 oo < ( £ rf) _1 is "small". Then it will 
take two more steps for the messages to properly represent the coloring (Vi, V 2 , V3), i.e., L3 = L 2 + 2. By contrast, 
if I j A(L 2 ) 1 1 00 > is "large", we will just need one more step (L3 = L 2 + 1). In both cases the proof is based 

on a direct analysis of the BP equations ( 12.21 ). 



Lemma 31. 7/O.Olecr 1 < || A(Z 2 ) || 00 < (ecTT 1 , then 



i + (l + 7(u, «,i))/9 j/ueFi, 
I — (1 + 7(u, w, i))/?' otherwise, 



(3.66) 



where \^{u,v,i)\ < e 3 ana? /3, /?' > e 2 . 
Proof. We have 

vt^ w ( L 2 + 1) 



IlueJV(D)\u) 1 |A^„(-£< 2 ) 



£ U ex P ("I £»eAr( )\« A^„(L 2 ) + 0(Ai^ v (L 2 )f 



3 

^exp 

3=1 



u£N(v)\w 



(3.67) 



Since for any v we have |-/V(w)| = 2d, we can essentially neglect the O(ed) 2 -term in ( 13.67b . More precisely, for 
some — e 2 < 72 = 7 2 (*, w) < e 2 we have 



VU W (L 2 + !) = (! + 72) 



3 

£ exp 



- £ aU(£2)-aL(i 2 



u£N(v)\w 



(3.68) 



To analyze (13.68b , assume without loss of generality that v € V\. Then ( 13.64b and ( 13.65b entail that there is a 
number — e 2 < 73 < e 2 such that 

]T A^^-A 2 ^^) = -(1+TsW 
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Consequently, rf\^, w {L 2 + 1) = (1 + 72) [1 + 2 exp (—(3/2 + 73)^/1)] . Finally, since /1 < 2(ed) \ we obtain 

3 



ril^ w (L 2 + l) = (I + 74) 



1 + 2 exp [ ——dfi 



(3.69) 



for some — 2e 2 < 74 = 74(1, v, w) < 2e 2 . 

Now, assume that v £ V 2 . Then ( 13.641 and ( 13.651 entail that there are numbers — e 2 < 75, < e 2 such that 

A^(L 2 )-A^(i 2 ) = 75 d M) 

BejV(«)\io 

£ A^(L 2 )-A 2 _,(L 2 ) = (3/2 + 76 )^. 

weJV(w)\'!« 



Therefore, 



r7^ w (i 2 + l) = (I + 74) 



2 + exp ( — d/i 



for some — 2e 2 < 74 = 74(2, w, u>) < 2e 2 . Combining ( 13.69b and ( 13.70b , we obtain the assertion. 



(3.70) 
□ 



Corollary 32. Suppose that. O.OleoT 1 < HA^Hoc < (ed) -1 . Then n$_> w {L 2 + 2) > 0.99 G K, W 
77^(L 2 + 2) <O.Oljrw0V o . 

Proof. We assume without loss of generality that a = 1. Moreover, suppose that wsFi. We shall bound the quotient 

7^jL 2 + 2) 



% 2 ^ u (£2 + 2) 

Qi 



= Q2 ■ Q3, where 

- n I:fc;£> "= 2 ^ 

u€Vjr\N(v)\w lu ^ vV ' 



(3.71) 



from below. Lemma [5T| implies that for u G V3 

1 - 



Vl^ 2 + l) 2/3 +(1-^ 

l- ?/ 2 _(L 2 + l) " 2/3+(l + e 3 )/3 ^ 1 + 3e ^ ^ 6e - 



Hence, 



3\d 



Qa > (1 - 6e d ) 



(3.72) 



Furthermore, for u G Lemma[3T|entails that 



l-^U(i2 + l) > 2/3 + (1 - = (1 - e 3 )(/? + ff) > 1 ^ 2 



1-^(L 2 + 1) " 2/3-(l + e3)/3 



2/3 -(1 -£»)/? 



Consequently, 



J2\d-1 



Q 2 > (l + 2e 2 ) c 

Combining (13.72b and (13.73b and recalling that d ^> e~ 2 , we obtain the assertion. 



(3.73) 
□ 



Lemma 33. Suppose that HA^Hoc > (ed) -1 . Thenr]^ w (L 2 + l) > 0.99 if v G V a , and ri%_> w (L 2 + 2) < 0.01 



Proo/ Since || A(i 2 ) || 00 > M)" 1 , ESS and (fX65l yield 

ft > {2ed)- 1 . 



(3.74) 
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Without loss of generality we may consider a vertex v £ V\ and a neighbor w £ N(v). We will prove that ?7j_ > . lu (1/2 + 
l)/rj%_ yw (L2 + 1) > 1000. Since Ylj=i vi-> w (-^'2 + 1) = 1> this implies the assertion. To bound the quotient from 
below, we decompose 

^ w {L2 + l) = g 2 .g here (3?5) 

n TT 1 vl^v (^2 ) . . „ 

^ = 11 (£2) f °^= 2 ' 3 ' 

With respect to Q 3 , ( 13.641 ) and ( 13.651 ) imply that for u £ V3 

1-^(^2) > 2/3+(l/2-e 3 )» 2e 3 // 3 

l-^(ia) " 2/3+(l/2 + e> 2/3 + (1/2 + e*)» - 

Hence, 

Os > (1 - 3eV) d - (3.76) 

Further, ( 13.641 ) and ( 13.651 ) yield that for u £ V2 

1-^,(^2) 2/3+(l/2-e 3 ) /J ) (3/2-2e 3 )/i 

l-^„(L a ) " 2/3-(l-e3)/i + 2/3+(l/2-(l- e 3)) At - 

Therefore, 

Q 2 > (l + 2fi) d - 1 . (3.77) 

Thus, combining ( I3.74l i- (I3.77| |. we obtain 

V r wi f L T 2 ^]] = Q 2 -Q 3 >(l-3e 3 M) ,i (l + 2M) <i - 1 >(l + M) d - 1 > 1000, 

Vv-,w( L 2 + 1) 

which implies the assertion. □ 
Finally, Proposition[T5lis a direct consequence of Corollary 1321 and Lemma[33l 



4 Proof of Corollary |2] 

Throughout this section, we assume that d> do for a sufficiently large constant do > 0, and that n > no = tiq (d) for 
a large enough no- Set p = d/n. 

Let G = G„.d,3 be a random graph with vertex set V = {1, . . . , 3n} and "planted" 3-coloring Vi, V2, V3. In order 
to analyze the adjacency A(G), we shall employ the following lemma, which follows immediately from the "converse 
expander mixing lemma" from |01 . 

Lemma 34. Let B = (V'U V", Eb) be a bipartite d-regular graph such that \ V'\ = \V"\. Assume that 

VS cF'.Tc V" : \e B (S,T) - \S\\T\p\ < d - 51 y/\S\F\, (4.1) 
where es(S, T) is the number of S-T -edges in B. Then the adjacency matrix A(B) enjoys the property: 

for any two vectors £, 77 £ H v uV such that both £, 77 are perpendicular to ly and ly> the inequality 
(A(B)£,r)) < d°- 52 \\^\\ || 77 1| holds. 

Moreover, the following lemma can be derived using standard techniques from the theory of random regular graphs [Chap- 
ter 9[9). 
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Lemma 35. W.h.p. G has the following property. Let 1 < i < j ' < 3. Then 



VS C Vi, T c Y, : |e G (S,T) - \S\\T\p\ < d - 51 ^/\S\fF\. 
Corollary 36. W.h.p. G is (d,0.01)-regular. 

Proof. Let A(G) = (av,w)v,weV denote the adjacency matrix of G. Moreover, let 

if v,w e Vi u y,-, , . 

otherwise \ — 1 3 — )■ 

Then A y = (a^Jn.ujy is the adjacency matrix of the bipartite subgraph of G induced on Vi U Vj. Let £ be the 
subspace of R v spanned by the three vectors ly k (k = 1,2, 3). Combining Lemma [341 with Lemma [351 we conclude 
that w.h.p. (A^i, n) < d 052 \\ £ \\ \\r]\\ for all £,7? _L 8 and any 1 < i < j < 3. Since A(G) = Ei<i<j<3 this 
implies that 

y^V±£: (A(G)Z,rt) <0.01d||£|| [ | r? 1 1 (4-2) 

(provided that <i is sufficiently large). Furthermore, as the construction of G ensures that each vertex uGl^ has exactly 
d neighbors in each class Vj ^ Vi, we can compute the vector = A(G)lvt as follows: for any v £ V 



c = E 



o ifi- e K, 

d ifu^y. 



w£Vi 

Hence, Q % = A(G)lvi = d lv5 ■ Therefore, for any 1 < i < j < 3 we have 

A(G)(l Vi -l Vj ) = -d(l Vi - lv 3 ). (4.3) 
Combining ( 14.21 ) and ( 14.31 ), we see that G is (d, 0.01)-regular w.h.p. □ 
Finally, Corollary [2] follows from Theorem[T]and Corollary [361 



5 Conclusion 

We have shown that BPCol 3-colors (d, 0.01)-regular graphs in polynomial time. Three potentially interesting exten- 
sions suggest themselves, which may be the subject of future work. 

1. In (d, 0.01)-regular graphs every vertex has precisely d neighbors in each color class except for its own. By 
comparison, in the planted random graph model studied in J2) the number that a vertex has in another color 
class is Poisson with mean d. It would be interesting to see if/how the present analysis can be modified to deal 
with such a more irregular degree distribution. 

2. Survey Propagation ("SP") is a more involved version of Belief Propagation (although SP can be rephrased 
as BP on a different model lfT3l ) and performs very well empirically on random graphs G(n,p). It would be 
interesting to extend our analysis to SP. 

3. In a (d, 0.01)-regular graph there is exactly one 3-coloring (up to permutations of the color classes). Nonetheless, 
we think that the techniques of our analysis can be extended to more complicated "solution spaces". For instance, 
it should be straightforward to deal with graphs that have a bounded number of distinct 3-colorings. 
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